SciELO - Scientific Electronic Library Online

 
vol.31 issue1Towards a framework for online information security applications development: A socio-technical approachParsing and analysis of a Xilinx FPGA bitstream for generating new hardware by direct bit manipulation in real-time author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand

Article

Indicators

Related links

  • On index processCited by Google
  • On index processSimilars in Google

Share


South African Computer Journal

On-line version ISSN 2313-7835
Print version ISSN 1015-7999

SACJ vol.31 n.1 Grahamstown Jul. 2019

http://dx.doi.org/10.18489/sacj.v31i1.674 

RESEARCH PAPERS

 

Computing research in South Africa: A scientometric investigation

 

 

Douglas A. Parry

Department of Information Science, Stellenbosch University, South Africa. dougaparry@sun.ac.za

 

 


ABSTRACT

Limited attention has been afforded to mapping the 'landscape' of South African computing research. Prior studies have considered singular sub-disciplines, publications, or publication types. Given the growing prominence of computing disciplines, it is necessary to identify the patterns of research production, publication, collaboration, and impact of South African computing research. This study presents a scientometric investigation in this regard. Through the analysis of data accessed from the Scopus citation enhanced bibliographic database, the investigation presents findings in relation to annual research production, institutional differences in outputs, topics, collaboration, and citation impact. While characterised by institutional differences, over the period considered, South African computing research output has increased at a greater rate than that of South African research at large. Additionally, despite accounting for a greater proportion of all outputs, conference papers account for a smaller proportion of citations relative to journal articles or book chapters. Corresponding to previous investigations, there exists a tendency towards applied computing topics in contrast to more theoretical topics. Finally, the collaboration network was shown to be particularly de-centralised with many researchers clustered around institutions. The findings are of interest to all researchers conducting computing or related research in South Africa.
CATEGORIES: · Applied computing ~ Digital libraries and archives · General and reference ~ Surveys and overviews

Keywords: South Africa, Computing Research, Scientometrics, Information Systems, Computer Science


 

 

1 INTRODUCTION

Over a decade ago Glass, Ramesh and Vessey (2004, p. 89) proclaimed the widespread notion of the "era of computing", remarking that the primary means of advancing computing as an academic discipline is through research. In South Africa, while a proud tradition of computing research stretches back to the 1970s (Kourie, 2010), computing and related fields, as academic communities, are small in comparison to developed countries (Brown & Tanner, 2008). Computing is broadly recognised in accordance with the proposed constitution of the South African Institute of Computer Scientists and Information Technologists (SAICSIT) to include: Computer Engineering, Computer Science, Information Systems, Information Technology and Software Engineering. While scientometric1investigations of specific aspects of South African computing research have been conducted (e.g., Brown & Tanner, 2008; Kotzé & Van Der Merwe, 2009; Sanders & Alexander, 2015; Turpin, 2018), no comprehensive investigation broadly considering the landscape of South African computing research has been conducted. Supporting the value of such investigations, specifically referring to Information Systems (IS) as an academic discipline, Lowry, Romans and Curtis (2004, p. 1) affirm that "where and how we publish are fundamental aspects of the identity of the IS field-reflecting our value systems, paradigms, cultural practices, reward systems, political hierarchy, and aspirations". The same can be said for computing research in South Africa on the whole. At present, while different interpretations of computing abound, such research is dispersed across numerous diverse institutions, departments, associations, academic backgrounds, and traditions. While studies have investigated the nature of computing research in other countries (e.g., Galliers & Whitley, 2007; Pouloudi, Poulymenakou & Pramatari, 2012; Sellitto, 2007), at present, only limited or anecdotal indications of the current shape of computing research in South Africa exist.

1.1 Objectives, Scope, and Outline

The objective of this study is to identify and describe the publication patterns in South African computing research. A scientometric approach involving the analysis of research output was adopted to approach the objective, with specific research questions presented in Section 3.1. Perron et al. (2017, p. 802) argue that "scientometrics is essential to helping academic disciplines understand various aspects of their research efforts". This includes considering the quality of research outputs, the productivity of researchers, the nature of research outputs, and the existence of collaboration networks, amongst other factors. Typically, scientometric investigations consider publication patterns within a specific domain, publication outlet, country, or institution. In this study, the analysis concerned the research outputs produced by researchers affiliated with South African universities for the period 2008 to 2017. In accordance with the Department of Higher Education and Training (DHET), a research output is understood as a report on an "original, systematic investigation undertaken in order to gain new knowledge and understanding" (Department of Higher Education and Training (DHET), 2015, p. 4). Four categories of output are recognised: journal articles, whole books, chapters in books, and published conference proceedings.

Given the broad and often divergent notions of what constitutes computing (Gruner, 2007), it is necessary to provide a brief definition to delineate the scope of this study. This task is hindered by the multifarious terms used to describe computing fields (e.g., Informatics, Computer Science, Information Systems etc.), the different domains in which computing is applied, and the sometimes arbitrary distinctions between computing sub-disciplines. Indeed, the Association for Computing Machinery (ACM) defines computing as "any goal-oriented activity requiring, benefiting from, or creating computers" (Shackelford et al., 2006, p.9). The inclusivity of this definition prevents it from providing a useful boundary for this study. A widely cited definition offered by the University of Edinburgh School of Informatics provides a more useful conceptualisation: "the central focus of Informatics is the transformation of information-whether by computation or communication, whether by organisms or artefacts" (University of Edinburgh School of Informatics, 2016). This definition, while still broad, emphasizes both the technical and the cognitive aspects of computing or informatics. Consequently, while the artefact of computation should, of course, be a focus of computing, a broader frame is necessary. Accepting that there is no one perfect conceptualisation for computing and, by extension, computing research, the widely adopted (and de facto standard) ACM Computing Classification System 20122 was adopted in this study. This classification system outlines 13 high-level categories broadly reflective of the gamut of the computing discipline, with each category containing at least four sub-levels.3

To follow, background to this study is provided by briefly reviewing five previous investigations of South African computing research. Thereafter, the specific research questions and data collection, processing, and analysis methodologies are outlined. Next, the findings of this investigation are presented in Section 4, followed by a discussion thereof in Section 5.

 

2 RELATED WORK

Five previous studies in this regard have been conducted. All five, however, focus on specific domains, publication sources, or types of publications. To provide background for the present investigation the contributions of each of these studies are briefly outlined.

Brown and Tanner (2008) investigated the international visibility of South African IS research. Considering journal publications in 50 top IS-centric journals for the period 2003 to 2007, these authors found that only 39 articles in 19 of the top-50 journals were authored by researchers affiliated with South African institutions. Noting the diminutive nature of this total, the authors considered South African-affiliated research published in a broader range of IS-centric journals. While no indication of totals was provided, Brown and Tanner (2008, p. 16) contend that "just as many, if not more articles were published outside of the top 50 ranked IS-centric journals". Three further trends during this period were noted. First, publications in higher ranked journals were typically conducted in collaboration with international researchers not affiliated with South African institutions. Second, seven key research themes were identified: (i) IS Management and Knowledge Management, (ii) IS and National Development, (iii) Education and Research, (iv) IS Projects and Systems Development, (v) Web and e-Commerce, (vi) Decision Making, and (vii) Mobile Applications. The third trend concerned institutional differences. A majority of publications considered were found to be attributable to researchers affiliated with just two institutions -The University of Cape Town (UCT) and Witwatersrand University (Wits).

In the following year Kotzé and Van Der Merwe (2009) considered the research foci of articles published in the South African Computer Journal (SACJ) from 1990 to 2008. Using the 1998 version of the ACM Computing Classification System4 Kotzé and Van Der Merwe (2009) classified all 344 articles into 11 categories, finding that research within only four categories -Computing Milieux (21.22%), Software (19.48%), Information Systems (16.57%), and Computing Methodologies (15.9%)- accounted for more than 73% of SACJ publications during this period. The researchers note that there had been a shift towards applied computing fields in contrast to research with a more theoretical direction. Supporting this, the most prominent sub-categories included: Management of Computing and Information Systems, Computers and Society, Computers and Education, and Software Engineering. Kotzé and Van Der Merwe (2009) also considered institutional differences in publication patterns, finding that researchers affiliated with three institutions -UCT, Wits and the University of Pretoria (UP)- accounted for a majority of publications.

More recently Sanders and Alexander (2015) considered computing-related doctorates produced at South African universities between 1978 and 2014. The researchers found that, over this period, 391 doctorates in computing fields have been awarded, increasing from 59 in 2000-2004 to 161 in 2010-2014. Sanders and Alexander (2015) note that this increase (173%) is greater than that observed for all doctorates in South Africa (125%) (Cloete, Sheppard & Bailey, 2015). As with previous investigations institutional differences in output were considered. In contrast to previous studies, however, the researchers did not identify any institutions in this analysis. Despite this, it was shown that six universities accounted for over 60% of computing doctorates in this period. To analyse research themes, Sanders and Alexander (2015) separated all doctorates into three categories -IS (104), CS (208), and 'other' (79)- only considering those in either IS or CS. For both domains, in the same manner as Kotzé and Van Der Merwe (2009), Sanders and Alexander (2015) used the ACM Computing Classification System5 to assign topics to each doctorate. For IS doctorates, professional topics (computing industry, management of computing and IS, history of computing, computing education, computing profession) were the most prominent category, followed by ICT for Development (ICT4D), education, enterprise computing, and adoption related topics. Corresponding to earlier findings indicating that research published by SACJ is shifting towards applied topics (Kotzé & Van Der Merwe, 2009), these prominent themes are all resemblant of applied aspects of computing research. For CS doctorates, model development and analysis topics were the most prominent category, followed by machine learning, artificial intelligence (AI), software creation and management, professional, human computer interaction (HCI), and algorithm design and analysis topics. Interestingly, Sanders and Alexander (2015) found there to be a considerable overlap in themes across IS and CS doctorates. For instance, computing education, software creation and management were evident across disciplines.

Extending from the prominence of research in ICT4D Turpin (2018) investigated the state of South African research outputs within this sub-discipline. Publications in three leading ICT4D journals6 were reviewed for the period 2006 to 2015 to identify articles authored by researchers affiliated with South African institutions. In the period considered 58 articles met this requirement. As with previous reviews of South African computing research, institutional differences were found, with UCT accounting for the largest proportion of ICT4D output in these three journals, followed by UP, Cape Peninsula University of Technology (CPUT), and Rhodes University (RU). To categorise publications Turpin (2018) developed her own classification system, identifying three research themes accounting for 73% of articles -method (impact assessment and monitoring & evaluation), access (connectivity and telecentres), and general business and SMEs. Additionally, the researcher investigated collaboration patterns, finding that of the 58 articles considered, only five (8.62%) were single-author papers, 28 (48.28%) were authored by researchers from the same institution, seven (12.07%) were authored by researchers from different South African institutions, and 18 (31.03%) were authored by researchers with at least one international affiliation among them. This indicates that, firstly, intra-institution collaboration was more prevalent then inter-institution collaboration and, secondly, when researchers collaborated between institutions, international partnerships were preferred to local partnerships.

In a more recent study, also considering ICT4D research, van Biljon and Naude (2018) investigated the collaboration patterns among researchers in this domain in South Africa. In the same manner as Turpin (2018), these researchers analysed collaboration patterns as indicated by publications in a basket of top ICT4D journals. While the same journals as the former study were used, a different time period for this analysis was adopted (2003 to 2016). As would be expected given Turpin (2018)'s results, the analysis indicated that collaboration in this domain was primarily intra-institutional, with limited inter-institutional collaboration between South African researchers. Following this, to investigate the drivers and barriers for research collaboration, van Biljon and Naude (2018) conducted a series of semi-structured interviews with 10 South African researchers shown to frequently collaborate within and between institutions. Drivers for collaboration were found to include: access to funding and resources, knowledge gains and building an international profile, while barriers were found to include: funding, time, absence of administrative support in managing grants, a lack of a collaborative history, and the incentives introduced by the funding model in South Africa.

From these five studies it is evident that, firstly, no comprehensive investigation of South African computing research has been conducted and secondly, that much of the research that has been conducted requires updating. At present, it appears that there are significant institutional differences in the pattern and extent of computing research in South Africa. Whether these differences extend beyond the scope of the studies considered requires further investigation. For both IS and CS research, while differences certainly exist, a shift towards more applied research topics was found. Whether this shift holds ten years later, and whether it extends beyond research published in a single journal or set of journals is unknown. Additionally, determining whether the growth in computing doctorates is reflected more generally in the growth of computing research requires further investigation. Only Turpin (2018) considered growth over time. This was, however, restricted to outputs in only three journals in a single sub-domain. There has been little consideration of collaboration networks. For IS, it was shown that publications in top-ranking journals typically involved collaboration with international researchers. Similarly, in ICT4D research, when researchers collaborated externally, international partnerships were preferred to local partnerships. More targeted analysis of collaboration networks among South African computing researchers is required to elucidate the pattern of collaboration within this field.

 

3 METHODOLOGY

To address the studyobjective an exploratory, descriptive, scientometric investigation ofSouth African computing research was conducted. The population of research considered is characterised by the affiliation of the researchers involved. Specifically, to be eligible for inclusion a research report must have been published between 2008 and 2017, have at least one author affiliated with a South African university, fall in one of the four categories of research output recognised by the DHET-journal articles (J), whole books (B), chapters in books (BC), and published conference proceedings (P) and, importantly, concern a relevant field of computing. To follow, four primary research questions are specified to guide the investigation. Following this, the procedures for data collection, processing, and analysis are outlined.

3.1 Research Questions

Building on previous research and to address the objective for the study, the following research questions were posed:

RQ1: What is the pattern of production of South African computing research over the period considered?

- RQ1a: Do institutional differences exist in terms of computing research output?

- RQ1b: How does the rate of growth in South African computing research compare to the rate ofgrowth ofall research in South Africa?

RQ1: Where do South African computing researchers publish their findings?

RQ2: What topics do South African computing researchers investigate?

RQ3: What is the pattern ofcollaboration amongst South African computing researchers?

RQ4: What is the impact, in terms ofcitations, ofSouth African computing research?

3.2 Data Collection

Relevant data for the period 2008 to 2017 were collected from the SCOPUS citation enhanced bibliographic database. This is a comprehensive database covering over 23700 titles from over 5000 publishers across a wide variety of academic domains (Elsevier, 2018). In addition to indexing academic output, the database contains related data on authors, affiliations, references, and citations, all of which are necessary for addressing the research questions. Referencing two prominent large-scale reviews, Boshoff and Akanmu (2018, p. 14) note that SCOPUS is often considered to be the most suitable source of data for bibliometric or scientometric analyses of African research. Moreover, at present, research outputs indexed in SCOPUS are accredited by the DHET.

Data were collected from the SCOPUS citation enhanced bibliographic database on the 3rd of September 2018. Consequently, all citation counts are valid up to this date and subsequent citations are not included in the analysis. The primary means of data collection involved the issuing of a search-string as a query to this database. This query contained three clauses. The first concerned institutional affiliations of the authors involved in producing a report. Through an 'OR' operator subclauses were specified for 23 universities in South Africa.7 The second clause concerned subject area. Of the 27 subject area categories represented in SCOPUS, the search was restricted to two categories, Computer Science (all) which includes 12 subcategories8 and Decision sciences (all) which includes four subcategories.9 The third clause concerned the time-period of the study, with publication years restricted to those between 2008 and 2017 (inclusive).

Additionally, two further queries were developed to address RQ1a and RQ1b. To determine the proportion of total South African research output attributable to computing domains, the 'subject-area' qualifier was removed from the original search string. All other components (institutions and years) remained the same. This query returned the number of research outputs, as indexed by SCOPUS, for each year in consideration. To determine the proportion of computing research attributable to researchers affiliated with different institutions the query was modified to only include one affiliation and then run for each university in South Africa.

3.3 Data Processing and Analysis

For each record returned the following fields were retrieved: author names (AU), title (TI), publication name (SO), ISO source abbreviation (JI), document type (DT), author-supplied Keywords (DE), keywords-plus10 (ID), abstract (AB), author address (C1), re-print address (RP), cited-references (CR), times cited (TC), publication year (PY), DOI (DI), and unique article identifier (UT). To decrease the size of the dataset the JI, RP DOI, and UT fields were removed. To identify and associate authors with their affiliations a new field, author-affiliations (AU_AF), was computed from the C1 field. Duplicate records were identified through the application of the restricted edit Damerau-Levenshtein distance to the titles with a similarity cut-off of 0.95. The main dataset was maintained separately from the individual datasets for each university.

The data were analysed from two perspectives. First, through considering the annual productivity of computing research, the publication types and sources, and the topics researched, the quantitative analysis of publication patterns were investigated. To investigate research topics keywords were analysed using a co-occurrence strategic mapping approach. As a means of understanding the impact of South African computing research, citation patterns were analysed from a quantitative perspective. The second analysis approach involved the application of network-analysis techniques. First, to further consider prominent research themes co-occurrence network graphs were produced. Second, to consider collaboration patterns, in addition to the quantification of authorship patterns, collaboration networks were analysed by examining the affiliation patterns of co-authors.

 

4 ANALYSIS AND RESULTS

The aforementioned query produced 11633 records. After removing duplicates (n = 296) the sample consisted of 11337 unique records across 12 document types-'conference paper' (n = 7000), 'article' (n = 4012), 'book chapter' (n = 277), 'editorial' (n = 114), 'erratum' (n = 11), 'review' (n = 124), 'conference review' (n = 1), 'book' (n = 26), 'note' (n = 21), 'letter' (n = 6), 'short survey5( n = 3), and 'article in press' ( n = 38). To more closely approximate the DHET research output categories, only records from the 'article' (JA), 'book' (B), 'book chapter' (BC), 'conference paper' (CP), 'conference review' (CR), and 'review' (R) document types were considered. This resulted in 157 records being discarded from the dataset, leaving a sample of 11180 records. Additionally, records in the review category were considered as articles, while conference reviews were considered as conference proceedings. In the following sub-sections the analysis of the data and the results thereof are presented. First, in Section 4.1 an overview of South African computing research outputs is provided. Following this, publication patterns are considered in Section 4.2. In Section 4.3 the analysis concerns the research areas considered by South African computing researchers. In Section

4.4 the collaboration patterns within this domain are addressed and, finally, in Section 4.5 the impact of South African computing research is considered.

4.1 Overview of Annual Research Outputs

Table 1 provides a summary of the output for all South African research and, specifically, South African computing research. Additionally, the annual percentage change for each category of research is presented as %Δ. Over the ten year period, annual computing research output has increased by 235.85%, while overall South African research output has increased by 172.61%. It is interesting to note that, while computing research output increased in most years, in 2014 research output decreased by 11.94% from the previous year. Despite this, computing research still accounted for over 6% of the total South African research output for 2014. A compound annual growth rate (CAGR) was computed to describe the average rate of growth in South African computing research and South African research overall, for the period 2008 to 2017. As with computing doctorates, with a CAGR of 12.88% South African computing research output has increased at a rate greater than that of total South African research output (a CAGR of 11.78%). Consequently, over the period in consideration, the proportion of South African research attributable to the computing field has grown from 8.99% in 2008 to 11.07% in 2017. Over the ten year period, 8.86% of all South African research could be categorised as computing research.

 

 

4.1.1 Institutional Differences in Annual Research Outputs

Extending from the literature reviewed, an analysis was conducted to determine if there exist institutional differences in overall research output in this domain. For this analysis, the affiliations (restricted to those of South African universities) of all authors were considered. This produced a total of 12362 affiliation-linked-outputs. Figure 1 depicts the yearly research output for each institution considered. It must be noted that, as a consequence of inter-institutional collaboration, the total represented in Figure 1 is greater than that of the total considered in the analysis (a report is counted equally for each institution for each listed author, with some authors listing multiple affiliations). Nonetheless, such a figure provides an indication of the proportion of research in this domain contributed by those associated with the various institutions. Overall, it is apparent that productivity (as represented by yearly output) differs dramatically by institution. Considering the distribution, four institutions (UP UCT, UJ, and UKZN) account for 52.43% of all affiliation-linked-outputs, with researchers affiliated with UP producing the largest proportion in the period considered (16.52% of all affiliation-linked-outputs).

4.2 Publication Sources For Computing Research

Figure 2 summarises yearly research output. In addition to the overall growth trend, the dominance of conference papers as a publishing medium is clearly evident. Overall, conference papers (including conference reviews) account for 61.40% of all South African computing research in the period considered. With 36.10% of outputs, journal articles (including reviews) account for the second largest proportion, followed by book chapters with 2.33% and books with 0.23%.

 

 

To identify the most prominent publication sources the reports were aggregated by source. Indicating a significant degree of concentration, the distribution of publication sources followed a power law (a = 2.085). However, prior to an in-depth consideration of publication patterns, irregularities in the data required cleaning. To account for sources appearing multiple times under different names this aggregate was sorted and the top 20% of sources (n = 436; accounting for 76.26% of all outputs) were inspected. Consequently, several instances of differential naming were identified.

Therefore, through pattern-matching, differential entries were identified and amalgamated. This resulted in the 2179 unique sources being reduced to 1770 unique sources. Again, the distribution of publication sources followed a power law (a = 1.83), with the top 20% of sources accounting for 78.33% of outputs. Indicating a considerable degree of concentration, 60 publication sources (3.39%) account for 50.00% of all research outputs in this domain. While not mathematically conforming to the conventional Bradford's Law of Scattering 11 seen in other domains, this distribution does approximate a degree of Bradford Distribution around a small primary zone of publication sources (n = 20) accounting for a third of outputs, with a second, larger zone (n = 145), accounting for another third, and then a considerably larger zone (n = 1606) of rarely used sources accounting for the final third.

Table 2 summarises the 25 most prominent sources. Collectively, these accounted for 37.00% of all outputs. Reiterating the dominance of conference proceedings in this domain, only seven of the top-25 publication sources represent journals, while the remaining 18 are conference proceedings. When considering the table there are a few factors to note. First, the source-'ACM International Conference Proceedings Series' represents the proceedings of the Annual Conference of the South African Institute of Computer Scientists and Information Technologists, SAICSIT'. Second, as of 2016, the proceedings of the 'Annual Conference of the Southern African Computer Lecturers' Association, SACLA' are published through Springer Verlag as part of the 'Communications in Computer and Information Science, CCIS' Series. Third, the absence of SACJ in this list is due to the journal only being indexed in SCOPUS since the 25th of March 2016. Consequently only volumes published subsequent to this date (vol 28 onwards) were considered in this analysis. Finally, of the top-25 sources, 11 (44%) are African in origin, while 14 (56%) are International.

4.3 Topics Researched By South African Computing Researchers

To understand the conceptual structure of South African computing research the author-supplied and keywords-plus (those supplied by SCOPUS) were considered. Across the dataset 23857 keywords and 52092 keywords-plus were present.

Table 3 presents the ten most frequently used author-supplied and SCOPUS-supplied keywords. As would be expected, the most commonly used keyword is 'South Africa', appearing in 266 author-supplied lists and 399 SCOPUS-supplied lists. These frequencies, however, indicate that only 2.59% of records use this keyword and only 3.88% have had this keyword assigned by SCOPUS. Together with the frequencies of the remaining prominent keywords, this indicates a significant degree of diversity in research themes considered. Despite this, as with publication sources, the distribution of author-supplied keywords followed a power law (a = 2.63), with the most prominent 10% of keywords accounting for 50% of all keyword use-instances. Additionally, while distinct keywords have been used, ICT4D, education, and security-related themes are prominent.

 

 

4.3.1 Analysis of Keywords: Co-occurrence Strategic Mapping

To further understand the conceptual structure of South African computing research co-word analysis of author-supplied keywords was conducted for both journal articles and conference proceedings (separately). Through clustering of keywords into thematic-networks and mapping their interconnections prominent themes were identified. Two metrics are used to describe each theme -Callon's density (referred to as density) and Callon's centrality (referred to as centrality) (Cahlik, 2000; Callon, Courtial & Laville, 1991). Density measures the internal strength of a network of keywords characterising a theme (the strength of internal ties within themes). In the context of research outputs density represents the ongoing development of a research theme. Centrality measures the degree of interactions between networks (the strength of external ties between themes). c = 10 χ ^ekh, where k is a keyword from the particular theme and h is a keyword in another theme (Callon et al., 1991). In the context of research outputs centrality represents the importance of a research theme to the domain in question. Median values, for each theme and metric, can be mapped in a two-dimensional space to form a Strategic Diagram. This enables the identification of four classes of themes. Figure 3 depicts the thematic map for South African computing research published in conference proceedings and Figure 4 depicts the thematic map for journal articles. In such diagrams each thematic-network is labeled by the keyword most central to the theme and the size of the theme is proportional to the number of records associated with it (Cobo, López-Herrera, Herrera-Viedma & Herrera, 2011).

 

 

 

 

Themes in the upper-right quadrant, termed motor-themes are characterised by high density and high centrality and, therefore, are well developed and important to the domain. For conference proceedings, as is evident in Figure 3, such themes include: Information Security, Machine Learning, Classification, and Cyber Security, amongst others. For journal articles, such themes include: ICT4D, South Africa, Blended Learning and, again, Information Security. Themes in the upper-left quadrant are characterised by high density, indicating strong internal ties, but low centrality. For conferences only one theme falls in this quadrant, WiMax. For journal articles, such themes include: L-Filter12, Artificial Neural Networks, and Scheduling.

Themes in the lower-right quadrant have high centrality and are, therefore, important to the domain but, given their low density are underdeveloped. For conference proceedings, such themes include: Remote Sensing, Particle Swarm Optimisation, and South Africa. It is worth commenting on the presence of 'South Africa' as a theme in this quadrant. The high-centrality of this theme is indicative of widespread association between the keyword and other themes, as is to be expected with publications produced by South African researchers. And, in the same manner, because it comes to be used so widely, its density or internal strength is less secure. It is, however, close to falling in the upper-right quadrant(as it does for journal articles). For journal articles, themes in this quadrant include: Climate Change, Uncertainty and, again, Particle Swarm Optimisation. Themes in the lower-left quadrant exhibit low density and centrality and are, therefore, marginal to other themes and under-developed. This low density implies that such themes are either still emerging and broad or waning in prominence. For conference proceedings, such themes include: Mobile Learning, Information Systems, and OFDM13. For journal articles, such themes include: Wireless Sensor Networks, Teaching/Learning Strategies, and Decision Support.

Across the two publishing domains a number of patterns are evident. First, for conference proceedings, on average the 'size' of the themes is larger than for journal articles, which indicates a greater number of records associated with each theme. Second, in a related manner, more key themes were identified for journal articles than for conference papers. Third, for both domains a majority of themes fall in either the upper-right or lower-left quadrants, indicating a strong core of well-developed and central research themes existing in parallel with a body of more peripheral themes either gaining traction or declining in prominence.

4.3.2 Analysis of Keywords: Co-occurrence Network Graphs

Extending from this conceptual mapping, further analysis of the co-word networks for author-supplied keywords was conducted. For conference proceedings and journal articles separate network graphs were generated to understand the relations between prominent research themes. For the sake of brevity and legibility, these graphs are not reproduced in this report.14 Rather, as is typical, to understand the nature of these networks, summary statistics are provided. Specifically, for three key measures of network-centrality (degree, eigenvector, and betweenness), the most prominent themes were identified.15 Centrality measures enable the identification of the most important vertices (nodes representing research themes in this case) in the structure of the network (David & Jon, 2010). While values are computed for each vertex and measure, what is of interest is, rather, the ranking of all vertices (Nomikos, Pantazopoulos, Karaliopoulos & Stavrakakis, 2014; Okamoto, Chen & Li, 2008). The actual value is, for the purposes of this investigation, irrelevant (Georg & Rose, 2016).

The relative indexing of each vertex enables the identification of key vertices in the network. Such themes are, consequently, important to the overall conceptual structure of research in this domain.

The simplest measure of network structure is Degree Centrality, a metric accounting for the number of ties a vertex has to other vertices (Freeman, 1978). Vertices with a higher degree centrality have more connections to other vertices and are more central to the conceptual structure of the domain. Table 4 summarises the top-15 vertices by degree centrality. Given the frequency of 'South Africa' as a keyword, it is not surprising that it is the vertex with the highest degree centrality for both conference proceedings and journal articles. Notable are high-ranking keywords appearing in both domains -'ICT4D', 'Africa', 'Wireless Sensor Networks', 'Information Security', and 'Particle Swarm Optimisation'. Indicating a difference in overall graph structure, no other vertices appear in the top-15 for both publishing domains. Overall, the 15 themes represented in Table 4 are, for each publishing domain, the themes which link most to other research themes. These themes, consequently, play a hub role in the network connecting many other themes.

 

 

Extending from degree centrality, Eigenvector Centrality considers the extent to which vertices are connected to other vertices in the network by rewarding connections to highly central vertices (Bonacich, 1987). Closely related to the PageRank centrality measure developed by Page, Brin, Motwani and Winograd (1999), a vertex is considered important if it is linked to by other important nodes. Table 5 presents the top-15 themes by eigenvector centrality. While 'South Africa', as a keyword, appeared in the top-15 for journal articles, it did not possess a high eigenvector centrality for conference proceedings. Indicating different conceptual structures, there are no overlaps between the two publishing domains for themes with high eigenvector centrality. Of interest, is the presence of four countries in the top-15 for journal articles (South Africa, China, Ghana, and Zambia). This is, perhaps, indicative of associations between articles using prominent ICT4D keywords and the country of investigation.

 

 

The final measure of network structure, betweenness centrality, measures the number of times a vertex is located on the shortest path between other vertices (Freeman, 1978). Vertices with a high betweenness centrality connect different vertices in the network, linking thematic areas (Yan & Ding, 2009). While betweenness centrality is conventionally understood to indicate that a vertex links between or bridges other vertices, it can also indicate that a vertex or theme is on the periphery of both related thematic-clusters. For South African computing research, for both journal articles and conference proceedings, 'South Africa' is the vertice with the highest betweenness centrality. Corroborating the two other centrality measures, the themes making up the remaining 14 vertices differ substantially between the two publication domains. While there are overlaps (e.g., 'wireless sensor networks') this indicates differences in the conceptual structure of research published in journals and conference proceedings. Overall, the themes summarised in Table 6 are considered to be key to the flow of concepts through the network.

Overall, the properties of the separate co-word network graphs for author-supplied keywords for both conference proceedings and journal articles indicate a considerable degree of divergence and decentralisation, with both graphs characterised by a density of 0.000. While both networks are particularly decentralised, differences do exist between reports published as journal articles (degree centralisation = 0.049; betweenness centralisation = 0.204; eigenvector centralisation = 0.997) and reports published in conference proceedings (degree centralisation = 0.026; betweenness centralisation = 0.07; eigenvector centralisation = 0.992). These values support the outcomes of the strategic mapping which suggest that, while thematic clusters do exist, computing research in South Africa is particularly diverse and decentralised. Many thematic areas or research sub-domains exist in isolation of each other with little overlaps or linkages with other sub-domains.

4.4 Collaboration Patterns Among South African Computing Researchers

To address RQ3 collaboration patterns, as represented by co-authorship, were considered. For the 11180 articles identified, 14506 unique authors were found. In total the dataset contained 32871 author instances. This indicates that, on average, a record was produced by 2.94 authors. Of the 11180 records, 1495, representing 13.37% of all outputs, were authored by a single individual. On average, each author considered has produced 0.77 records.

4.4.1 Initial Analysis of Authorship Patterns

To provide an initial summary of collaboration among South African computing researchers two metrics are considered. First, degree ofcollaboration (C) was calculated to determine the ratio of collaborative research outputs (as indicated by co-authorship) to total research outputs. Following this, the collaboration index (CI) was calculated to determine the average number of authors per report for multi-authored reports. Table 7 summarises the key authorship statistics for the period under consideration. Overall, for South African computing research, on average, the degree of collaboration is 0.87. This indicates that computing research in South Africa is predominantly conducted collaboratively (less than 15% of papers are single-authored). Moreover, as indicated by the CI, on average multi-author papers are produced by 3.24 collaborators.

 

 

As would be expected given norms in scientific publishing the distribution of authorship followed an inverse power law (a = 2.27), with 9936 authors involved in only a single paper. To further apprehend the authorship patterns a two-sample Kolmogorov-Smirnov goodness-of-fit test was conducted to determine if Lotka's Law held for the sample in question. Lotka's Law describes the productivity of authors in a given domain (Lotka, 1926) and accounts for the distribution of publication productivity (Pao, 1986, p. 305). In accordance with Lotka's law 68.50% of authors have only a single publication, 14.50% have two publications, 5.45% have three publications, and 3.02% have four publications. At the other end of the distribution, collectively, the three most productive authors account for 429 (3.84%) publications between them, while the ten most productive account for 1011 publications (9.04%).16

4.4.2 Analysis of Collaboration Networks

After accounting for duplicates, 1435 unique international institutions from 110 countries were present. Overall, 5473 instances of international affiliations were identified. Two caveats necessary to acknowledge at this stage are, firstly, that many authors maintain and list multiple affiliations and, secondly, many authors hold both local and international affiliations. Table 8 provides a summary of the affiliation-totals for the countries and institutions for seven key regions. It is evident that a majority (63.42%) of international collaboration instances involve authors with affiliations in either Europe or North America. Only 12.42% of international collaboration instances are with institutions in other African countries. The ten most frequently collaborated with countries, outside of South Africa, are: the USA (899), the UK (566), China (407), Germany (373), Canada (253), The Netherlands (216), Nigeria (209), India (195), France (193), and Australia (180).

 

 

Of the top 200 international institutions (representing 56.20%) South African computing researchers collaborate with, 15.02% are African, 20.94% are North American, 12.45% are Asian and, the largest proportion, 44.28% are European. Table 9 provides a summary of the 20 most-collaborated with institutions. Six are in Africa, seven are in Europe, four in North America, two in Asia, and one in Australasia. Overall, considering the properties of the network graph for institutional collaboration (both local and international), the density of the network is low at 0.005. This indicates that the inter-institutional collaboration network is characterised by decentralised, isolated components with many collaboration networks existing in isolation from others. This is supported by values of the three centralisation measures considered for the network (degree centralisation = 0.25; betweenness centralisation = 0.25; eigenvector centralisation = 0.95). For all of degree, closeness, and betweenness centrality the top five nodes are: UCT, UP UKZN, WITS, and UJ, respectively. For Eigenvector centrality the top five is made up of international institutions. North-West University is the highest ranked local institution at eighth in the network.

For individual authors, the collaboration network was considered by examining patterns of co-authorship. As with institutional collaboration, the density of the network was low at 0.002. This is supported by values for the centralisation measures considered (degree centralisation = 0.03; betweenness centralisation = 0.03; eigenvector centralisation = 0.97). The computing research network (including both local and international collaborators) is characterised by isolated clusters of nodes containing small numbers of authors as well as individual, isolated authors.

4.5 Impact of South African Computing Research

Across the articles considered 52215 citations have been recorded in the the SCOPUS citation enhanced bibliographic database, with a mean of 4.67 (SD = 25.59) per article. Of the 11180 reports considered, 4382 (39.19%) had not been cited at the time of analysis. Of these, 2066 (47.15%) were published within two years of data collection. Considering just the articles cited at least once (6798; 60.81%), the mean number of citations moves to 7.68 (SD = 32.47), with a median of 3. These averages are affected by the presence of a number of highly cited articles. The 15 most cited articles together account for 7321 citations (14.02%), with the article with the most citations (van der Walt, Colbert and Varoquaux (2011)), itself, accounting for 1481 citations at the time of analysis (2.84%). Figure 5 provides a summary of the total citations received across all article types for each year considered. While the citation delay accounts, in part, for year-on-year differences, it is evident that, despite accounting for a larger proportion of reports published (61.40%), conference papers account for a much smaller proportion of total citations for each year considered. Overall, across the ten years considered, journal articles account for 38211 citations (73.18%), books account for 405 citations (0.78%), and conference papers account for 13599 citations (26.04%). For reports with at least one citation, again, there are differences in the yearly citation rate. For journal articles, on average, a report is cited a mean of 2.42 (SD = 6.68) per year since publication, whereas for conference papers, a report is cited a mean of 0.93 (SD = 1.50) times per year since publication.

 

5 DISCUSSION

Through the scientometric analysis ofrelevant publication data for South African computing research this study endeavoured to provide an overview of the publication patterns for the field and identify key shortcomings and areas of strength.

The analysis indicated that, over the period considered, South African computing research has increased year-on-year at a greater rate of growth than that of South African research at large. This outcome extends Sanders and Alexander (2015) who found that growth in computing doctorates grew at a rate greater than that of all doctorates in South Africa. This growth in computing related research maybe indicative of the increasingly important role of computing for individuals, society, and academia or, relative to other disciplines, it maybe indicative of differential publishing patterns. Given the nature of the database search, it may also be indicative of a move towards more interdisciplinary work with computing playing a key bridging role to many other related disciplines. As was the case with Brown and Tanner (2008), Kotzé and Van Der Merwe (2009), and Turpin (2018), the output of South African computing research is characterised by marked institutional differences. Specifically, for the outputs considered in this investigation, UP UCT, UJ, and UKZN account for a majority of South African computing research. In part this corresponds to previous investigations which have shown that, for prominent IS research between 2003 and 2007 UCT and Wits prevail while, for publications in SACJ between 1990 and 2008 UCT, Wits and UP account for a majority of publications and, for prominent ICT4D research, researchers from UCT, UP, CPUT, and RU are more productive. The broader scope of analysis, years considered, and data collection method may account for the slight differences with these previous studies. Nonetheless, it is evident that across computing sub-disciplines there exist at least three highly productive institutions in South Africa, followed by a larger group of moderately productive institutions and, behind that, a trailing group of relatively unproductive institutions. Of course, when considering these patterns it is necessary to acknowledge structural factors in the South African higher education landscape which, certainly, affect the nature and productivity of research in South Africa. Moreover, such outputs are also affected by staff size, teaching loads, available time for research, supervision capacity, funding, and differences in policies for the appointment of external research fellows and associates.

Considering where and how computing research is published it is evident that conference papers are far more prevalent than journal articles or book chapters. Moreover, approximating typical power law distributions for such data, publication of South African computing research, across sub-domains, is concentrated in a small number of sources. Considering the specific sources, the preponderance of general domain conferences and journals which attract and publish submissions from a wide variety of sub-disciplines and research areas potentially accounts for this outcome. While South African computing research is certainly published in sub-discipline specific journals or conference proceedings, it is evident from this analysis that, on the whole, a large proportion of research is published through general conferences and journals which are, by contrast, targeted to a broader cross-section of the field or, in some cases, multiple distinct fields.

To investigate the prominent topics and research themes considered by South African computing researchers keyword analyses were conducted. Before discussion of these results, it is necessary to acknowledge that authors use keywords for a variety of reasons and that such terms may hold different meanings in different contexts. Moreover, keywords can also be particularly ambiguous. For instance, terms such as 'algorithms', 'artificial intelligence', or 'South Africa' are broad and not necessarily indicative of the specific subject-matter. While previous studies have endeavoured to distinguish between IS and CS research themes, the analysis conducted in this study did not. Nonetheless, corresponding to Brown and Tanner (2008), Kotzé and Van Der Merwe (2009), and Sanders and Alexander (2015) the analysis indicated a tendency towards applied computing topics. This is perhaps also indicative of the widespread applicability of computing to other disciplines and research areas. It may, however, also reflect preferences or capabilities to address particular questions or problems. Additionally, while many overlaps existed in the conceptual structure, differences were shown between reports published as journal articles and those published as conference papers. This outcome may reflect different publishing norms across sub-domains. Moreover, the analysis indicated that, while thematic clusters do exist, computing research in South Africa is particularly diverse and decentralised. Many thematic areas or research sub-domains exist in isolation of each other with little overlaps or linkages with other sub-domains. While this outcome reflects the large variety in research themes addressed by South African researchers, it may also be reflective of the broad nature of the subject area categories covered by the database targeted. Additionally, further research is required to determine if this pattern is characteristic of computing research at large or if particular qualities of the South African computing field account for this finding.

For authorship, it is evident that a majority of researchers choose to work collaboratively, with a small minority of outputs produced individually. As is normal, authorship followed a power-law distribution, with a majority of authors involved in only a small number of papers. This outcome is suggestive of, firstly, once-off collaborations and, secondly, projects involving students who, upon completion, leave academia. Considering collaboration networks, it was shown that, outside of South Africa, a majority of collaboration involves researchers in either North America or Europe, with little collaboration with researchers in other African countries. Despite these trends, it is evident that the collaboration network is particularly diverse and de-centralised with many researchers or teams of researchers working isolated from the network at large. Such isolated clusters are centralised around institutions and subject areas. Given the broad nature of computing research and the many sub-domains considered in this analysis (e.g., IS, CS, Computer Engineering etc), such a pattern is not unexpected.

Finally, the analysis considered the impact, in terms of citations, of South African computing research. Notwithstanding the shortcomings of citations as pure indicators of impact (e.g., Belter, 2015; Costas & Bordons, 2007) the citation analysis indicated that a majority of articles had been cited at least once since publication. Interestingly, despite accounting for a greater proportion of all published reports, conference papers account for a much smaller proportion of citations relative to journal articles. This difference may be as a result of publication differences, with preliminary or early work published as conference papers and more complete, comprehensive reports published as journal articles. It may also reflect differences in availability between these two publishing media, with access to journal articles through publication databases being more widespread.

5.1 Recommendations for Computing Research in South Africa

Despite the undoubted progress and quality of South African computing research there remains a number of key areas in which improvements can be achieved. In particular, while growth in output is outpacing total research output in South Africa which is, itself, outpacing global averages (NRF, 2018), arrangements need to be put in place to ensure the sustainability of this growth. Additionally, while there are a number of highly productive institutions, actions necessary to improve the output of less productive institutions need to be identified. Moreover, while there are a number of highly cited works produced by South African computing researchers, more emphasis needs to be placed on improving the overall quality of research produced. Unfortunately, the current subsidy model applied by the DHET does, to some extent, work against efforts to ensure greater quality in emphasising quantity of research output units. Moreover, the DHET subsidy system does not encourage or support international collaboration (Woodiwiss, 2012). Given the central role of funding in the production and publication of research, it is necessary to acknowledge the changing funding landscape in South Africa. Increasingly, in the face of changes in how the DHET and NRF fund researchers and research projects, researchers need to seek funding from alternative local and international sources.

It is acknowledged that teaching and research are intimately linked. It is recommended, therefore, that researchers seek to achieve a greater alignment between their teaching and their research, especially in the case of postgraduate teaching. As Kroeze, Pretorius and Roode (2010, p. 2) notes, "in academia the future is research, both for academic staff and students". Consequently, a key area of focus for improving both the quantity and quality of computing research in South Africa should be the targeting of increased student throughput from undergraduate programs through to higher levels of postgraduate study where, typically, research skills and output are emphasised. It is also recommended that, for doctoral programmes, institutions consider adopting the article model for theses to promote processes of external feedback and continued research productivity. Further research is of course necessary to determine the effects of these recommendations on the quality and quantity of research outputs.

Given the indication that the network of collaboration is particularly sparse and decentralised, and previous findings showing positive associations between research collaboration and citation impact (Larivière, Gingras, Sugimoto & Tsou, 2015), it is recommended that researchers increase the extent of co-supervision in masters and doctoral programmes and, where possible, that such relationships extend across institutions (both local and international). In particular, it is recommended that researchers allocate explicit effort to developing stronger inter-institutional partnerships, both locally and internationally, especially for burgeoning research areas like data science and artificial intelligence. Despite the potential value of such collaborations, it is acknowledged that the current research subsidy model presents a substantial roadblock in this regard. In addition to inter-institutional collaboration, a greater emphasis on intra-institutional collaboration is recommended. Specifically, the establishment of research groups involving both established and junior academics as well as students will facilitate communities of practice which can lead to higher quality research outputs (Kroeze et al., 2010). Considering regional collaboration, it is recommended that in regions with a number of institutions (e.g., Gauteng or the Western Cape), regular regional colloquia be established to provide increased opportunities for communication, collaboration and feedback.

Finally, as is the case in, for instance, Economics it is recommended that the local conferences SAICSIT and SACLA place a greater emphasis on implementing working paper series. Through increased feedback and discussion, it is argued, the quality of final research outputs will be improved. In the absence of such opportunities, as has been the case in recent years with SAICSIT, where appropriate, the conversion of conference papers to journal articles should be encouraged. As the findings indicate such articles typically have a greater impact in terms of citations.

 

6 CONCLUSION

The scientometric investigation ofSouth African computing research presented in this studyprovides, firstly, an overview of the landscape of South African computing research, secondly, an understanding of where strengths lie and, thirdly, an identification of areas for which work is required to grow the quality and quantity of research outputs.

Despite the value of the the findings presented in this report, it is necessary to briefly acknowledge a number of limitations to the present investigation. First, as a consequence of collecting data from only a single bibliographic database, the data do not necessary present a complete representation of all South African computing research. Despite the breadth in coverage of the SCOPUS citation enhanced bibliographic database, gaps in coverage do exist, especially with regards to citation data. In a related manner, given the nature of the search query employed, the coverage of subject areas was particularly broad and, unfortunately, did not necessarily conform to the definition of computing adopted in this study or include all possible sub-fields. A third limitation relates to the years under consideration. Only data from reports published between 2008 and 2017 (inclusive) were considered. Consequently, the findings do not provide any insight into publication patterns before or after the period considered. Finally, as the study merely considered data relating to publication, collaboration, and citation patterns, identification of factors either driving or hindering publication quality or growth was not possible.

Notwithstanding these limitations, the study findings are broadly of interest to researchers across computing sub-disciplines in South Africa. While a number of specific recommendations have been made, further interpretations of the findings and, on this basis, suggestions for practice may arise which can further aid South African computing researchers in producing a healthy volume of high quality research outputs. Future investigations of South African computing research should endeavour to compare such patterns with, firstly, other countries and, secondly, other research domains within South Africa. It is also recommended that a temporal approach be adopted, with changes in research patterns over time being considered. Additionally, it may be of value to consider alternative metrics (e.g., Altmetric, social media sharing, traditional media attention, patents, or industry consulting or collaboration) when studying the broader societal impact of South African computing research.

Additionally, it is noted that in many cases the teaching loads facing researchers present significant barriers to research progress in terms of both quantity and quality. Consequently, it is argued that reductions in teaching loads through the re-balancing of teaching responsibilities, the use of online resources, assistants and, if need be, external consultants can positively contribute to research output. To determine if this is the case, longitudinal studies assessing the effects of re-balancing teaching responsibilities on research output should be conducted.

In addition to these recommendations, to provide further insight into institutional differences in research output, it is recommended that an inter-institutional qualitative study be conducted. It may be useful to select a sample of highly productive, moderately productive, and relatively unproductive institutions and interview associated researchers to identify possible drivers and barriers for their productivity. In addition to providing insights into possible drivers of research output, such an investigation will enable the assessment of the proposition that achieving a greater alignment between teaching activities and research activities will facilitate improvements in the quality and quantity of research outputs, as well as the proposition that changes in postgraduate supervision and thesis structures will lead to increased research output.

 

References

Belter, C. W. (2015). Bibliometric indicators: Opportunities and limits. Journal of the Medical Library Association : JMLA, 103(4), 219-221. http://dx.doi.org/10.3163/1536-5050.103.4O14        [ Links ]

Bonacich, P (1987). Power and centrality: A family of measures. American Journal of Sociology, 92(5), 1170-1182.         [ Links ]

Boshoff, N. & Akanmu, M. A. (2018). Scopus or Web of Science for a bibliometric profile of pharmacy research at a Nigerian university? South African Journal of Libraries and Information Science, 83(2), 12-22. http://dx.doi.org/10.7553/83-2-1682        [ Links ]

Bradford, S. (1934). Sources of information on specific subjects. Engineering: An Illustrated Weekly Journal, 137, 85-86.         [ Links ]

Brown, I. & Tanner, M. (2008). The international visibility of South African IS research: An author-affiliation analysis in the top IS-Ranked IS-Centric journals. South African Computer Journal, 42, 14-20.         [ Links ]

Cahlik, T. (2000). Comparison of the maps of science. Scientometrics, 49(3), 373-387. http://dx.doi.org/10.1023/A:1010581421990        [ Links ]

Callon, M., Courtial, J. P & Laville, F. (1991). Co-word analysis as a tool for describing the network of interactions between basic and technological research: The case of polymer chemsitry. Scientometrics, 22(1), 155-205. http://dx.doi.org/10.1007/BF02019280        [ Links ]

Cloete, N., Sheppard, C. & Bailey, T. (2015). South Africa as a PhD Hub? In N. Cloete, P Maassen & T. Bailey (Eds.), Knowledge production and contradictory functions in African higher education. (Chap. 5, pp. 75-108). Cape Town: African Minds.         [ Links ]

Cobo, M., López-Herrera, A., Herrera-Viedma, E. & Herrera, F. (2011). An approach for detecting, quantifying, and visualizing the evolution of a research field: A practical application to the fuzzy sets theory field. Journal of Informetrics, 5(1), 146-166. http://dx.doi.org/10.1016/j.joi.2010.10.002        [ Links ]

Costas, R. & Bordons, M. (2007). The h-index: Advantages, limitations and its relation with other bibliometric indicators at the micro level. Journal of Informetrics, 1(3), 193-203. http://dx.doi.org/10.1016/j.joi.2007.02.001        [ Links ]

David, E. & Jon, K. (2010). Networks, crowds, and markets: Reasoning about a highly connected world. New York, NY, USA: Cambridge University Press.         [ Links ]

Department of Higher Education and Training (DHET). (2015). Research outputs policy, 2015. Government Gazette, 597(38552), 1-24.         [ Links ]

Elsevier. (2018). Scopus: Content coverage guide 2018. Elsevier Publishing.

Freeman, L. C. (1978). Centrality in social networks conceptual clarification. Social Networks, 1 (3), 215-239. http://dx.doi.org/10.1016/0378-8733(78)90021-7        [ Links ]

Galliers, R. D. & Whitley, E. A. (2007). Vive les differences? Developing a profile of European information systems research as a basis for international comparisons. European Journal of Information Systems, 16(1), 20-35. http://dx.doi.org/10.1057/palgrave.ejis.3000662        [ Links ]

Georg, C.-P & Rose, M. E. (2016). Mirror, mirror on the wall, who is the most central of them all? (Working Papers No. 571). Economic Research Southern Africa. Retrieved from https://EconPapers.repec.org/RePEc:rza:wpaper:571

Glass, R. L., Ramesh, V & Vessey, I. (2004). An analysis of research in computing disciplines. Communications of the ACM, 47(6), 89-94. http://dx.doi.org/10.1145/990680.990686        [ Links ]

Gruner, S. (2007). Letter to the Editor: What is Informatics ? South African Computer Journal, 39, 54-55.         [ Links ]

Kotzé, P & Van Der Merwe, A. (2009). The research foci of computing research in South Africa as reflected by publications in the South African Computer Journal. South African Computer Journal, (44). http://dx.doi.org/10.18489/sacj.v44i0.24        [ Links ]

Kourie, D. G. (2010). The South African Computer Journal: 1989 to 2010. Transactions of the Royal Society of South Africa, 65(2), 107-111. http://dx.doi.org/10.1080/0035919X.2010.510664        [ Links ]

Kroeze, J. H., Pretorius, P D. & Roode, J. D. (2010). The role of an IT research professor in South Africa. Sprouts: Working Papers on Information Systems, 10(32).         [ Links ]

Larivière, V, Gingras, Y., Sugimoto, C. R. & Tsou, A. (2015). Team size matters: Collaboration and scientific impact since 1900. Journal of the Association for Information Science and Technology, 66(7), 1323-1332. http://dx.doi.org/10.1002/asi.23266        [ Links ]

Lotka, A. J. (1926). The frequency distribution of scientific productivity. Journal of the Washington Academy of Sciences, 16(12), 317-323.         [ Links ]

Lowry, P., Romans, D. & Curtis, A. (2004). Global journal prestige and supporting disciplines: A scientometric study of Information Systems journals. Journal of the Association for Information Systems, 5(2), 29-80. http://dx.doi.org/10.17705/1jais.00045        [ Links ]

Nomikos, G., Pantazopoulos, P, Karaliopoulos, M. & Stavrakakis, I. (2014). Comparative assessment of centrality indices and implications on the vulnerability of ISP networks. In 2014 26th international teletraffic congress (ITC) (pp. 1-9). http://dx.doi.org/10.1109/ITC.2014.6932932

NRF. (2018). SA growth of research outputs surpasses world average. Science Matters, 1 (2), 13. Retrieved from https://www.nrf.ac.za/NRF_SMMag/Vol1/Issue2/index.html#p=13        [ Links ]

Okamoto, K., Chen, W. & Li, X.-Y. (2008). Ranking of closeness centrality for large-scale social networks. In International workshop on frontiers in algorithmics (pp. 186-195). Springer. http://dx.doi.org/10.1007/978-3-540-69311-6_21

Opsahl, T., Agneessens, F. & Skvoretz, J. (2010). Node centrality in weighted networks: Generalizing degree and shortest paths. Social Networks, 32(3), 245-251. http://dx.doi.org/10.1016/j.socnet.2010.03.006        [ Links ]

Page, L., Brin, S., Motwani, R. & Winograd, T. (1999). The PageRank citation ranking: Bringing order to the web. (tech. rep. No. SIDL-WP-1999-0120). Stanford InfoLab. Retrieved from http://ilpubs.stanford.edu:8090/422/1/1999-66.pdf

Pao, M. L. (1986). An empirical examination of Lotka's law. Journal of the American Society for Information Science, 37(1), 26-33.         [ Links ]

Perron, B. E., Victor, B. G., Hodge, D. R., Salas-Wright, C. P, Vaughn, M. G. & Taylor, R. J. (2017). Laying the foundations for scientometric research: A data science approach. Research on Social Work Practice, 27(7), 802-812. http://dx.doi.org/10.1177/1049731515624966        [ Links ]

Pouloudi, N., Poulymenakou, A. & Pramatari, K. (2012). A profile of information systems research in the Mediterranean region. European Journal of Information Systems, 21 (4), 345-357. http://dx.doi.org/10.1057/ejis.2012.31        [ Links ]

Sanders, I. D. & Alexander, P M. (2015). A study of computing doctorates in South Africa from 1978 to 2014. South African Computer Journal, (57), 58-89. http://dx.doi.org/10.18489/sacj.v0i57.294        [ Links ]

Sellitto, C. (2007). A study of journal publication diversity within the Australian information systems sphere. Australasian Journal of Information Systems, 14(2), 19-42. http://dx.doi.org/10.3127/ajis.v14i2.470        [ Links ]

Shackelford, R., McGettrick, A., Sloan, R., Topi, H., Davies, G., Kamali, R., ... Lunt, B. (2006). Computing curricula 2005: The overview report. In Proceedings of the 37th SIGCSE Technical Symposium on Computer Science Education (pp. 456^57). SIGCSE '06. http://dx.doi.org/10.1145/1121341.1121482

Turpin, M. (2018). Assessing South African ICT4D research outputs: A journal review. South African Computer Journal, 30(1), 108-127. http://dx.doi.org/10.18489/sacj.v30i1.541        [ Links ]

University of Edinburgh School of Informatics. (2016). What is Informatics? Last accessed 4 April 2019. Retrieved from https://www.ed.ac.uk/files/atoms/files//what20is20informatics.pdf

van der Walt, S., Colbert, S. C. & Varoquaux, G. (2011). The NumPy array: A structure for efficient numerical computation. Computing in Science & Engineering, 13(2), 22-30. http://dx.doi.org/10.1109/MCSE.2011.37        [ Links ]

van Biljon, J. & Naude, F. (2018). Collaboration towards a more inclusive society: The case of South African ICT4D researchers. In D. Kreps, C. Ess, L. Leenen & K. Kimppa (Eds.), This changes everything - ICT and climate change: What can we do? (pp. 82-94). http://dx.doi.org/10.1007/978-3-319-99605-9_6

Wasserman, S. & Faust, K. (1994). Social network analysis: Methods and applications. Structural Analysis in the Social Sciences. Cambridge: Cambridge University Press.         [ Links ]

Woodiwiss, A. J. (2012). Publication subsidies: Challenges and dilemmas facing South African researchers. Cardiovascular journal of Africa, 23(8), 421-427.         [ Links ]

Yan, E. & Ding, Y. (2009). Applying centrality measures to impact analysis: A coauthorship network analysis. Journal of the American Society for Information Science and Technology, 60(10), 21072118. http://dx.doi.org/10.1002/asi.21128        [ Links ]

 

 

Received: 8 Jan 2019
Accepted: 14 Mar 2019
Available online: 24 Jul 2019

 

 

1 Scientometrics, as a sub-discipline of Information Science, is broadly the quantitative study of scientific research activities. Incorporating bibliometric analyses, this includes, among others, citation analysis, publication analysis, collaboration analysis, and productivity analysis.
2 https://www.acm.org/publications/class-2012.
3 General and reference, Hardware, Computer systems organization, Networks, Software and its engineering, Theory of Computation, Mathematics of Computing, Information Systems, Security and Privacy, Human-centered Computing, Computing Methodologies, Applied Computing, and Social and Professional Topics.
4 ACM: https://www.acm.org/publications/computing-classification-system/1998/ccs98
5 While Kotzé and Van Der Merwe (2009) used the 1998 edition of this classification system, Sanders and Alexander (2015) used the 2012 edition (https://www.acm.org/publications/class-2012).
6 6 Information Technology for Development (ITD), Information Technologies and International Development (ITID), and the Electronic Journal on Information Systems in Developing Countries (EJISDC).
7 University of the Witwatersrand, University of Johannesburg, Sol Plaatje University, University of Mpumalanga, Mangosuthu University of Technology, Central University of Technology, University of Venda, University of Fort Hare, University of Limpopo, Durban University of Technology, Rhodes University, Tshwane University of Technology, University of South Africa, University of the Free State, University of Pretoria, North West University, Cape Peninsula University of Technology, University of the Western Cape, Nelson Mandela Metropolitan University, Nelson Mandela University, University of KwaZulu Natal, University of Zululand, University of Cape Town, Stellenbosch University.
8 Computer Science - Miscellaneous, Artificial Intelligence, Computational Theory and Mathematics, Computer Graphics and Computer-aided Design, Computer Networks and Communications, Computer Science Applications, Computer Vision and Pattern Recognition, Hardware and Architecture, Human-computer Interaction, Information Systems, Signal Processing, and Software.
9 Decision Sciences - Miscellaneous, information systems and Management, Management Science and Operations Research, and Statistics, Probability and Uncertainty.
10 Keywords automatically supplied by SCOPUS.
11 Bradford's Law states that: "If scientific journals are arranged in order of decreasing productivity of articles on a given subject, they may be divided into a nucleus of periodicals more particularly devoted to the subject and several groups or zones containing the same articles as the nucleus, when the number of periodicals in the nucleus and succeeding zones will be as 1 : n : n, where n is a multiplier" (Bradford, 1934).
12 A form of linear analog electronic filter.
13 Orthogonal Frequency Division Multiplexing.
14 Given the size of the graphs and the resolution of the article, it would not be possible to detect any meaningful content from the complete graphs at such a size.
15 It is noted that a fourth measure of centrality is frequently considered in network analysis-closeness centrality. This metric represents the average length of the shortest path between a vertex and all other vertices in the graph. However, in graphs with disconnected vertices (as is the case in this study) infinite distances are present and, therefore, this measure is not necessarily applicable (Opsahl, Agneessens & Skvoretz, 2010; Wasserman & Faust, 1994).
16 The top ten authors represented in the dataset at the time of data collection were: AP Engelbrecht (173); T Marwala (154); S von Solms (102); HC Ferreira (101); K Djouani (93); GP Hancke (92); A Bagula (83); Y Hamam (83); N Pillay (66); and N Ventura (64).

Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License